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Deep insight can be gained into the nature of nonclassical correlations by studying the quantum operations 
that create them. Motivated by this we propose a measure of nonclassicality of a quantum operation utilizing 
the relative entropy to quantify its commutativity with the completely dephasing operation. We show that our 
measure of nonclassicality is a sum of two independent contributions, the generating power - its ability to 
produce nonclassical states out of classical ones, and the distinguishing power - its usefulness to a classical 
observer for distinguishing between classical and nonclassical states. Each of these effects individually leads 
to quantum advantages in information processing. Our measure allows us to interpret quantum discord as the 
difference in superdense coding capacities between a quantum state and the best classical state when both are 
produced at a source that makes a classical error during transmission. 



Introduction. Identifying the resources that underlie quan- 
tum advantages in information processing is a crucial ques- 
tion of fundamental and technological importance. Generally, 
quantum entanglement is ascribed this role due to its neces- 
sity in a number of tasks providing quantum advantages [1,2]. 
However, quantum enhancements have been exhibited in cer- 
tain computations with limited amounts of entanglement or 
even none at all when the involved quantum state is mixed [3- 
7]. Universal quantum computation with pure states also ap- 
pears to be possible with little entanglement [8]. In addition to 
computational advantages, quantum communication can also 
exhibit advantages over classical communication in the ab- 
sence of entanglement [9-11]. 

Recently, it has been suggested that correlations beyond 
quantum entanglement might provide quantum enhance- 
ments. One of the most common quantities is the quantum 
discord [12-15]. Quantum discord has recently been inter- 
preted as the difference in the performance of the quantum 
state merging protocol between a state and its locally de- 
cohered equivalent, provided the decoherence is completely 
dephasing [16], as well as the entanglement consumption in 
quantum state merging [17]. The role of quantum discord in a 
more general family of protocols has also been studied [18]. 

An important difference between quantum discord and en- 
tanglement is that the latter is non-increasing, on average, un- 
der local operations and classical communication. This is the 
underlying principle of the resource theory of quantum en- 
tanglement. On the other hand, local operations can actually 
increase quantum discord [19-22]. Discordant states can be 
created out of classical states by a local channel if and only 
if the channel changes the local algebraic structure [23], and 
several authors have studied the evolution of quantum dis- 
cord under various forms of dynamics [24-28]. However, the 
principles underlying the creation of nonclassical correlations 
from quantum operations are still lacking. 

Here we investigate the nonclassicality of quantum oper- 
ations directly. Before presenting our results it is essential 
to clarify our notion of what is classical. Our criterion is 
based on einselection, or environment induced superselection, 
a process via which states of a quantum system become en- 



tangled with the environment, effectively measuring certain 
observables of the quantum system [29] . We will denote this 
process as T. Any operation CI of the form CI = T o CI o r, 
where o is the composition of operations, is certainly classi- 
cal since it always classicalizes the input and gives a classical 
output. Since T 2 = T, this implies the commutation relation 
CI o r = r o CI. Taking this relation as the foundation of our 
notion of classicality will be justified by its implications. We 
consider classical states p c to be the fixed points of the eins- 
election operator T so that T(p c ) = p c . Thus they are of the 
form p c = Y,afi Pafi |a) (a| ® \P) (j8|, where \a) , |j8) are the 
complete orthonormal eigenbasis of the einselection operator 
which acted on both parties of the bipartite system. The op- 
erator r may also act only on one subsystem, in which case 
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FIG. 1: Illustration of Thm. (1). The dashed lines represent relative 
entropies corresponding to the terms from Thm. 1, while the solid 
lines represent operations. The two paths from the state CI o T(p) to 
ToCl(p) are equidistant in relative entropy. Quantities (i) and (ii) 
represent the generating and the distinguishing power, respectively. 
We consider classical states to be the fixed points of the linear ein- 
selection operator T and as such is a simplex. This set is smaller 
than the set of separable [1,2] and zero-discord states [7]. Note that 
our notion of classicality is stricter than that enforced by quantum 
discord since there is no freedom to choose the classical basis. 
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we get one-sided classicality with invariant states of the form 
P = JLaPaPa ® \oc) (cc\. While all classical states have zero 
discord, not all zero discord states are classical in the sense 
used here. A classical observer is one who can measure only 
in the einselected basis, and for whom the quantum state p is 
completely indistinguishable from the state T(p). Such states 
have identical diagonal elements in the einselected basis, but 
differ in the off-diagonal elements. In contrast a quantum ob- 
server may be able to distinguish between the states p and 
T(p), given enough copies. 

In this Letter, we introduce a measure of quantumness of 
operations that applies to all completely positive maps. Our 
definition is inspired by the fact that a classical map com- 
mutes with the einselection operation T. The extent of non- 
commutativity is measured using the quantum relative entropy 
between two different orderings of the dephasing operation T 
and an operation CI. We show this measure to be composed 
of two independent contributions - firstly, the ability of a non- 
classical map to produce non-classical states from classical 
ones, and secondly, the degree to which it enables classical 
observers to distinguish states they could otherwise not dis- 
tinguish classically. We show how these two contributions 
can provide quantum enhancements in information process- 
ing protocols in and of themselves. Our measure possesses 
several intuitive properties such as being non-increasing un- 
der composition with classical maps and being convex. We 
calculate our measure for some entangling and correlating op- 
erations and then apply it to interpret quantum discord via the 
capacity of superdense coding with noisy states. 

We define the quantumness of an operation CI as 

Wr(£i) = sup S(C1 or(p)||ro£l(p)) , (1) 
P 

where S(p\\a) = Tr [p(log(p) — log(a))] is the quantum rel- 
ative entropy [30], and all logarithms in this paper are base 
2. The supremum in Eq. (4) is taken over all quantum states, 
but as shown in the Appendix it is sufficient to maximize over 
pure states only. The quantity Wr thus intuitively measures 
the deviation of the commutator [fi, T] from zero by apply- 
ing both orderings to the same state and comparing the out- 
puts [39]. The definition of implicitly depends on T, how- 
ever we will suppress this in our subsequent discussions. We 
now present our main result. 

Theorem 1. The quantumness of an operation W(Ci) is the 
sum of two independent contributions 

w(ci) = sup( k s(ronor(p)||ron(p))+ 
p V 

s(nor(p)||ronor(p))), (2) 

where the supremum is over all quantum states. 

The first term, which we call the distinguishing power, 
characterizes how well a classical observer can distinguish be- 
tween CloT(p) and fl(p). The second term, which we call 



the generating power, measures the ability of the map CI to 
generate a nonclassical state out of a classical input. This is 
depicted in Fig. (1). Following Klein's inequality, W(C1) van- 
ishes if and only if the operation CI obeys [Cl,T\ =0 and so is 
classical. An implication of Thm. (1) is that an operation CI 
is classical only if it has neither distinguishing nor generating 
power. The proof of Thm. (1) follows from the monotonicity 
of relative entropy under completely positive maps [32] and is 
provided in the Appendix. 

Crucially, the distinguishing and generating powers in 
Eq. (2) can be independently zero. Given a quantum operation 
CI where both these quantities are non- vanishing, we can con- 
struct an operation r o CI for which the second term in Eq. (2) 
vanishes but the first term is unchanged. Thus, W(ToCl) is the 
maximum distinguishing power of CI. On the other hand, for 
the operation CloT, the first term of Eq. (2) vanishes while the 
second one is unchanged and therefore W(CloT) is the max- 
imum generating power of CI. By definition, if the operation 
CI is classical, both these quantities must be zero. 

There are instances where both terms play essential and in- 
dependent roles in a quantum protocol. As an example, con- 
sider the BB84 quantum cryptography [33]. In order to en- 
gage in the protocol, Alice must be able to prepare states in 
two non-orthogonal bases, which requires only the power to 
generate non-classical states, implying non-vanishing gener- 
ating power. Bob, on the other hand, needs to be able to distin- 
guish between classical and non-classical states in order to ex- 
tract the key and detect the presence of an eavesdropper, thus 
requiring an operation with non-zero distinguishing power. 

The measure of quantumness W(Ci) has some additional 
properties which are physically intuitive, such as 

(PI) Extremality: Maximum in the supremum is attained 
with a pure state. This similarly follows from the joint con- 
vexity of relative entropy. 

(P2) Monotonicity: Given a general operation CI and 
a classical operation Cl c , then W(C1 C o CI) < W(Ci) and 
W(C1 o Cl c ) <W(C1) holds, showing that the measure is non- 
increasing under composition. 

(P3) Convexity: The convexity follows from the joint con- 
vexity of relative entropy. Thus, given two observers with 
classical maps Clf,Clf at their disposal, and shared source 
of randomness, they cannot create a nonclassical operator. 
In other words, if W(Clf <g> 1) = and W(l <8> Clf ) = then 

wx iPi cit®a?) = o. 

The proofs of the above properties are given in the Ap- 
pendix as Thm. 2-4. We next evaluate our measure for com- 
mon decoherence channels, a local discord-generating and 
nonlocal entanglement-generating operation. 

Examples. Here we focus on qubits with a classical basis 
as |0),|1) and T implementing two-sided einselection. For 
unitary operations U we have that W(U) = if and only if 
U is a combination of a classical permutation matrix of the 
classical basis states with phase shifts. This is proved in 
the Appendix as Thm. 5. For standard qubit error models 
[33], we similarly find that if the errors occur in the classi- 
cal basis then they have vanishing W. Since Pauli matrices 
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are permutations of the classical basis up to a phase, such 
models include any Pauli channels on a single qubit, such as 
the depolarising, bit-flip, phase-flip and the phase-damping 
channel. The measure W also vanishes for the amplitude- 



damping channel, S y (p) = F\pF\ + F 2 pF J 2 , where F\ = 
|0> <0| + vT=y|l)(l| and F 2 = y/y\0)(l\, since its Kraus 
operators correspond to permutation matrices, up to a phase. 

However, if we change the basis away from the classi- 
cal basis, quantumness may arise. For example, consider 
H o S o H in which the amplitude damping channel sand- 
wiched between Hadamard gates H. In this case the measure 
W is maximized for the state |0), at which point only the gen- 
erating power contributes, giving the value W(H 0E0H) = 
log(fl) + (fl/2)log((l +a)/(1 -a)) + (fr/2)lo g((l +&)/(! - 
b)), where a = \Jl-J an d b = \J y 1 ■ - y+ 1. Notice that al- 
though the amplitude damping channel is completely classical 
on its own, removing it from the sequence of operations would 
leave only HoH = 1, making the sequence classical. 

On the other hand, the Hadamard gate H on its own at- 
tains infinite quantumness. The maximum is attained for the 
states |±) = (|0) ± 1 1))/ y/2, for which generating power van- 
ishes and the distinguishing power is infinite, owing to the 
logarithm in the definition of relative entropy. Infinite quan- 
tumness tells us that the Hadamard gate can be used to ascer- 
tain that the local state is classical p = (|0) (0| + |1) (l|)/2 
and not |+) with certainty in a finite number of measure- 
ments on average. The maximum W continues to be attained 
for |+) even when H is followed by a depolarizing chan- 
nel, A JU (p) = jup + (1 — n)l/d, where d is the dimension 
of the Hilbert space, although it is no longer infinite. The 
non-classicality of two unitary operations Cl\ and Cl 2 with 
infinite W can be compared through the use of regulariza- 
tion. For example using a depolarising channel and evaluating 
lim^i W(A jl o&i)/W(An o&2)- The depolarising channel 
thus acts as a regulator and the correct ratio is obtained in the 
limit where the regulator becomes the identity. This can give 
us a meaningful ratio of the quantumness of any two opera- 
tions whenever W diverges. 

An example of a local channel generating non-classical 
correlations is given in Ref. [19]. The map is of the form 
a = 1 <g)fl fi , where ft fi (p) = E\pE\ + E 2 pE\ and E\ = 
|0) (0|, E 2 |+) (1| [19]. Applying the map CI to the clas- 
sical state a c = i(|0) (0| <8> |0) (0| + |1) (1| <g> |1) (1|) leads to 
p = i(|0)(0|® |0)(0| + |1)(1|® |+)(+|), which has non- 
zero discord but vanishing entanglement. By computing 
S(aor(a c )||ron(a c )), we find W(Cl) = 1, show that this 
map is indeed nonclassical. Moreover the distinguishing 
power of this map vanishes for all states p . 

Next we look at an entangling operation, specifically a 
CNOT controlled in the |±) basis capable of generating a 
maximally entangled two-qubit state. As shown in Fig. 2 we 
find that when this is followed by a joint two-qubit depolaris- 
ing channel A^ below jl = 2/3 W is maximized by the gen- 
erating power alone, while above p, = 2/3 it is maximized 
purely by the distinguishing power. Thus, as this critical point 



is crossed the maximum quantumness of this noisy CNOT op- 
eration switches from being exposed by its ability to generate 
nonclassicality to its ability to distinguish nonclassicality. 

An interpretation of quantum discord. Suppose Alice and 
Bob would like to perform superdense coding using one of 
two types of states ordered from a source, either a quantum 
state p or a cheaper, completely dephased version T(p). How- 
ever, regardless of what they order, they actually receive fl(p) 
or CloT(p), respectively, which accounts for transmission 
imperfections. The question we now ask is how much addi- 
tional information can they transfer using the superdense cod- 
ing protocol if they ordered the quantum state p rather than 
T(p). We will show that if H is classical so W(Ci) = 0, then 
the capacity difference is equal to precisely quantum discord 
of £l(p), where W is evaluated with r = 1 <g> Tb acting on the 
receiver's (Bob's) side. This one-sided einselection operator 
is used to match the definition of the standard quantum dis- 
cord [12, 13]. While this result holds in general, for simplicity 
we assume that p = \& d ) 1, the maximally entangled state 
\®d) = Ha \ a ) ® \oc) I \fd. In this case T(p) is the maximally 
classically correlated state. 

The capacity of superdense coding [34] using a state p is 
given by F(p A \ B ) = log(d A ) - S(p A \ B ), where S(p A \ B ) de- 
notes the conditional entropy of the state p [40]. Zurek's 
original definition of quantum discord [36] is Q z (p A \ B ) = 
LaPaS(p^) - S(p A \ B ), where pj* is the marginal state on Al- 
ice's side given that outcome a was obtained, correspond- 
ing to the rank-1 projector II a . Using basic properties 
of von Neumann entropy [33], we have that Q z (p A \ B ) = 
S(T(p A \ B )) - S(p A \ B ). Assuming [r, Q] = 0, then gives 

F(a\<P d ) (^ B )-F(aor\^ d ) (^ B ). (3) 

Extending this to the usual definition of quantum discord Q 
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FIG. 2: Quantumness of CNOT controlled |±) basis, followed by 
the depolarizing channel as a function of jl. We maximized W 
and split the expression into the generating power, green line, and the 
distinguishing power, dashed red line. When /I is small the action of 
the depolarizing channel is large and degrades distinguishability to 
such an extent that the generating power dominates. When the — > 
1 , on the other hand, generating power is fundamentally bounded by 
log (d) and thus can no longer compete with the distinguishing power 
which experiences unbounded growth. The maximum changes from 
the generating to the distinguishing power at = 2/3. 
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[12, 13] which involves a minimization over II a , Eq. (3) 
transforms into Q(C1 \<S> d ) <<^| A|5 ) = F(C1 \<S> d ) <<^| A|5 ) - 
sup r F(flor|<l></) (<£j| A|5 ). Thus quantum discord is the dif- 
ference in the capacity of superdense coding using the maxi- 
mally entangled state and the best possible classically corre- 
lated state. Our results show that quantum advantage can be 
gained over the initially classical state in the presence of noise 
even when fl(|4>^) (®d\) is unentangled. This is illustrated in 
Fig. (3) where CI = is the depolarising channel. 

Discussion. In this Letter we have proposed a measure of 
nonclassicality of quantum operations. The measure is a sum 
of two independent contributions, the generating power and 
the distinguishing power, which characterizes an operation as 
non-classical if and only if the operation can be used by clas- 
sical observers to distinguish between quantum and classical 
states or creates non-classical states out of a classical states. 

Our measure satisfies several intuitive properties such 
as convexity and monotonicity under composition of clas- 
sical maps. In addition, our results show that the ein- 
selected relative entropy of discord Q g (p) = S(p\\T(p)) , 
is non-increasing under the action of classical maps. 
This is seen by observing that for a classical opera- 
tion Cl c , we have Q g (Cl(p)) = S (Cl c (p)\\T o Cl c (p)) = 
S(a c (p)\\a c or(p)) < S(p\\T(p)) = Q g (jy) by monotonic- 
ity of relative entropy. 

Furthermore, it is interesting to note that there is a natu- 
ral complementarity between quantumness of operations and 
quantumness of states. In particular, we could define a 
measure of quantumness of states through W as Qw(p) = 
inf (CI), where the minimization is over all operations CI 
generating the state p. It is clear from Thm. (1) that quan- 
tumness of any operation generating p must satisfy W(Cl) > 
Q g (p). We conjecture that Qw(p) = Q g (p). Finally, our mea- 
sure W might also be extended beyond completely positive 
maps if maximization is performed over the set of compatible 
classical states [37, 38]. 
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Supplementary Material for 
"Quantifying the nonclassicality of operations" 

This supplementary material contains the proofs of the main result Theorem 1 and Properties P1-P3, along with a derivation 
of the value of our nonclassicality measure W for unitary maps. 



PROOF OF THE MAIN RESULT 

Theorem 1 relates to the decomposing our measure W of non-classicality into two contributions. Specifically, 
Theorem 2. The quantumness of an operation W(Ci) is the sum of two independent contributions 

w(G) = sup^5(roftor(p)||rofl(p)) +s(a r(p)||roa r(p))^, (4) 

where the supremum is over all quantum states. 

Proof We start with the relative entropy, defined as S (p || a) — Tr [p(log(p) — log(a))], featuring under maximization in W(C1). 
Then we insert the sum of a complete set of orthonormal projectors £ a II a = 1, where II a are the Kraus operators of T. We 
thus obtain 

s(£ior(p)||roO(p)) = -s(Oor(p))-Tr[£ior(p)iog(ro£i(p))] (5) 
= -s(Oor(p))-Tr[£n a Oor(p)iog(roQ(p))]. (6) 

a 

where S(p) = — Tr[plog(p)] is the von Neumann entropy. Next we use the fact the projective property (X a II a ) 2 = 
together with the cyclic property of the trace and the fact that II a commutes with To ft(p) and thus also with its logarithm. The 
above is then transformed to 

s(£ior(p)||roQ( P )) = -s(Oor(p))-TrEn a Oor(p)n a iog(roO( P ))] (7) 

a 

= -5(ftor(p))-Tr[roftor(p)iog(roO(p))]. (8) 

Next we add and subtract ^(roHo T(p)) to the righthand side to get to 

s(Qor(p)||roQ(p)) =5(roQor(p)) -5(Qor(p))+5(roQor(p)||roQ(p)). (9) 

We now expand the entropy ^(roQo T(p)) to give 

5(flor(p)||rofl(p)) = -Tr[roftor(p)iog(roflor(p))] -5(flor(p)) + k s(ronor(p)||ron(p)). (io) 

Now we expand T and insert back the orthogonal projective operators II a yielding 

s(Oor(p)||ro£i(p)) = -Tr[£n a £ior(p)n a iog(ro£ior(p))] - 5(£ior(p)) +^(ro^or(p)||ro^(p)). (ii) 

a 

Now because II a commutes with r o CI o T, we find that 

s(£ior(p)||roO(p)) = -s(£ior(p)) -Tr[£n a Oor(p)iog(roaor(p))]+5(roaor(p)||roQ(p)). (12) 

a 

The first two terms then form another relative entropy, leading us to 

5(flor(p)||roft(p)) =s(ronor(p)||roa(p)) +s(a r(p)||ronor(p)). (13) 

Inserting the supremum over p then forms W(Cl) and completes the proof. □ 
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PROOFS OF PROPERTIES P1-P3 



Here we give the proof of property PI that the maximum in the maximization for W is always attained for a pure state. 

Theorem 3. Given sup p S(ftor(p)||ro£2(p)), there exists a pure state \\jf) (\jf\ such that the supremum in equation (4) is 
attained when p = \\jf) 

Proof. Imagine that we have performed maximization over only the set of pure states and found that the maximum is attained 
for | \jf) . Then for some mixed state p we can spectrally decompose it as p = £y jij | (j)j) (<j>j | , where | are it eigenstates. Since 
the relative entropy is jointly convex in its arguments [33] this implies that 

s(nor(p)||ron( P ))<£^s(nor(|fc)^ 

j 

This completes the proof. □ 

Next we will consider the property P2, stating that the measure W is non-increasing under the composition with classical 
maps. 

Theorem 4. If a is some map and W(Cl c ) = then W(Ck c o a) < W(Cl) and W(Q o Q c ) < W(G) . 
Proof Notice that 

w(Q c oa) = sups(a c oQor(p)\\Q c oroci(p)) < sup k s(nor(p)||ron(p)) = w(a), (U) 
p p 

where the last inequality is due to the monotonicity of relative entropy under completely positive operations (and thus also the 
strong subadditivity of the von Neumann entropy, which is equivalent to monotonicity [32]). For the reverse order 

w(£ioa c ) = sups(Ooron c (p)||roflon c (p)) 
p 

= sup s(nor(p)||roa(p)) 

< sups(aor(p)||ron(p)) = w(a), (15) 
p 

where going from second to the third line we changed the set over which we take supremum from all states to the set of states of 
the form ft(p). Since this set is entirely contained in the set of all states, the inequality follows. □ 

Finally, the property P3 is proved in the following theorem. 

Theorem 5. Given a set of local operations O^, £1^ such thatW(Q^ <g> 1) = andW(l <g> Ofy = then W(&) — Ofor any local 
operation with shared randomness of the form CI = £ w y w O^ <8> Q^. 

Proof Notice that since we required that T be composed of local orthonormal projectors we can write it as T = T A (g) T B in the 
bipartite case. Given that £1^ and commute with T A and T B , respectively, we also have that £1 commutes with T, establishing 
the result. □ 



QUANTUMNESS OF UNITARIES 



Now we turn our attention to the proof of the statement regarding the quantumness of unitary operations. 

Theorem 6. When T acts on the entire joint Hilbert space, selecting a complete orthonormal classical basis \k), we have for 
any unitary operation U that W(U) =0 if and only ifU = Y.k e ^ k W) (^1^ where P is a permutation of the classical basis and 
<j>k are phases. Otherwise W(U) = °°. 

Proof. We proceed by computing the relative entropy S(U o T(p) \\ToU(p)). First we show that if U is not of the required 
form, then W(U) = °°. Under such assumption, there exists a non-classical state 1 0o) such that U |0o) = \j), where | j) is any 
classical basis state. The relative entropy S(p\\a) is infinite due to the term Tr[plog(cr)] when the kernel of a has a non-zero 
overlap with the support of p. So suppose 1 0o) = Hk a k \k) is me expansion of |0o) in the classical basis. Then r(|0o) (00 1) = 
Y,k \ a k\ 2 \k) (k\ |0o) (0o I , since |0o) is not classical by assumption. Thus, UT(\(j)o) ((j)o\)U^ will in general have support across 
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numerous classical states besides \j) However, the second argument of the relative entropy is T(U \(j>o) ((j>o\ U^) = \j) (j\ and 
thus is a state with a kernel overlapping with the support of UT(\(j)o) ((j>o\)U\ making the relative entropy infinite. 

Secondly, we show that if W(U) = «>, then some classical state \k) is mapped to a non-classical state. Since the generating 
power is the einselected relative entropy of discord of the output state, we know that it must be bounded by log (J), where d is 
the dimension of the joint Hilbert space. Therefore, if W(U) = °°, the distinguishing power is infinite. There exists a state \\jf) 
such that 

s(r(ur(\ w )( w \)ui)\\r(u\ w )( w \ui)) =~. (16) 

Now \\jf) cannot be classical, otherwise the above would vanish. So let \y/} = Y^kTk \k) an d F(l V") (v\) = Y,k \7k\ 2 \k) (k\. Label 
U\k) = |%>. Then we have that r(i/r(|vA)(vA|)i/ t ) = Lk\Yk\ 2 n\Vk) (¥k\) and T(U \ ¥ ) (y\ = £ M y k ^r(\ ¥k ) Thus 
we see that if \\j/k) were a ll classical, then the distinguishing power would vanish. Therefore we must have that at least one of 
the states \ Yk) is not classical, showing that U is not of the form in the theorem statement. We have thus shown that U is not a 
permutation matrix up to a phase if and only if W(U) = °°. Conversely, when U is a permutation matrix up to a phase, we know 
that W vanishes. This completes the proof. □ 



